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Improved Checgcrs Inequality and Analysis of Local Graph 
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Abstract 

We prove two generalizations of the Cheeger’s inequality. The first generalization relates the 
second eigenvalue to the edge expansion and the vertex expansion of the graph G, 

a 2 = SK4> V (G) • 0(G)), 

where <f) V (G) denotes the robust vertex expansion of G and <f>{G) denotes the edge expansion 
of G. The second generalization relates the second eigenvalue to the edge expansion and the 
expansion profile of G, for all k > 2, 


Aa = n(2 - 0*(G) - 0(G)), 

where <pk(G) denotes the k -way expansion of G. These show that the spectral partitioning algo¬ 
rithm has better performance guarantees when <f> v (G) is large (e.g. planted random instances) 
or 4>k{G) is large (instances with few disjoint non-expanding sets). Both bounds are tight up to 
a constant factor. 

Our approach is based on a method to analyze solutions of Laplacian systems, and this 
allows us to extend the results to local graph partitioning algorithms. In particular, we show 
that our approach can be used to analyze personal pagerank vectors, and to give a local graph 
partitioning algorithm for the small-set expansion problem with performance guarantees similar 
to the generalizations of Cheeger’s inequality. We also present a spectral approach to prove 
similar results for the truncated random walk algorithm. These show that local graph parti¬ 
tioning algorithms almost match the performance of the spectral partitioning algorithm, with 
the additional advantages that they apply to the small-set expansion problem and their running 
time could be sublinear. Our techniques provide common approaches to analyze the spectral 
partitioning algorithm and local graph partitioning algorithms. 
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1 Introduction 


Let G = (V,E) be a complete weighted graph and n := \V\. For simplicity, we assume that the 
graph is regular and the total weight on each vertex is one throughout 1 . Let w(S,T) be the total 
weight of the edges with one vertex in S and another vertex in T. The edge expansion of a set 
S C V and the edge expansion of a graph G are defined as 

0(<S) := — and 0(G) := min 0(S). 

|<Sj S:|S|<|V|/2 

Let L = I — A be the Laplacian matrix of G where I and A are the identity and the adjacency 
matrix of G, with eigenvalues of L being 0 = Ai < A 2 < ... < A n < 2. Cheeger’s inequality [Che70, 
AM85, AI 086 ] bounds the edge expansion of G using the second eigenvalue of L, 

-A 2 < 0(G) < \/ 2 A 2 . 

It is useful in bounding the edge expansion of a graph and also bounding the mixing time of 
random walks [HLW06]. The proof of Cheeger’s inequality gives an efficient algorithm to find a 
set with expansion at most y/ 2 \ 2 , and we will refer to this algorithm as the spectral partitioning 
algorithm (also known as the sweep cut algorithm on the second eigenvector). A recent generaliza¬ 
tion [KLLOT13] of Cheeger’s inequality bounds the edge expansion of G using the second and the 
fc-th eigenvalues of L for any k > 2, 


0(G) = 0(k)—^=. 

This provides a better analysis of the spectral partitioning algorithm in practical instances of image 
segmentation and data clustering. 

1.1 Our results 

We prove two new generalizations of Cheeger’s inequality. These provide better analyses of the 
spectral partitioning algorithm when some expansion parameters of the graph are large. We also 
prove similar bounds for the personal pagerank algorithm and the truncated random walk algo¬ 
rithm. These give local graph partitioning algorithms for the small-set expansion problem with 
improved Cheeger’s guarantees. Our techniques provide common approaches to analyze the spec¬ 
tral partitioning algorithm and local graph partitioning algorithms. 

1.1.1 Vertex Expansion 

The first generalization bounds the second eigenvalue of L by the edge expansion and the vertex 
expansion of G. We define the robust vertex expansion following Kannan, Lovasz and Montene- 

1 By standard arguments, the results can be extended to handle non-regular graphs using the notion of conductance 
and the normalized Laplacian matrix. 
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gro [KLM06]. For Scy, let N 1/2 (S) := min{|T| | T C V - 5 and w(S,T) > \ w(S,S)j. Define 

(f) V (5) ■= — - and 4> V (G) := min 0 y (5) 

|5| V S:|S|<|V|/2 

as the robust vertex expansion 2 of G. Also define 

V(S) := 0(5) -0 y (5) and T(G) := min V(S) 

S:|S|<|V|/2 

as the minimum product of the edge expansion and the robust vertex expansion. The following is 
a generalization of Cheeger’s inequality using robust vertex expansion. 

Theorem 1. 

A 2 = D(min{T(G),0(G)}). 

Corollary 1. 

a 2 = D(0 y (G) • 0(G)). 

Note that 0 y (5) > |0(5) and so Corollary 1 is a generalization of Cheeger’s inequality. Observe 
that 0 y (5) could be much larger than 0(5) when the edges crossing 5 spread out. For exam¬ 
ple, randomly generated instances such as those in the planted partition model [Bop87, McSOl] 
have 0 y (G) = D(l), and thus Theorem 1 implies that the spectral partitioning algorithm is a 
constant factor approximation algorithm for those instances 3 . Another interesting example is the 
hypercube 4 . 

1.1.2 Expansion Profile 

The 5-small-set expansion (0 < 8 < 1 / 2 ) of G and the A;-way expansion (k > 2) of G are defined as 

d>s(G) := min 0(5) and 0GG) := min max 0(50. 

V ' 5:|5|<5|V0 V V ^ Si,...,S fc : S i nSj=0 l<i<fc V 

The curve 0,5 (G) for 0<<5<l/2is defined by Lovasz and Kannan [LK99] and is called the 
expansion profile of G. Note that 0(G) = 0i/ 2 (G) = 0 2 (G). The following is a generalization of 
Cheeger’s inequality using k -way expansion. 

2 Note that the usual definition of vertex expansion, define as inins,|,g|<|y |/2 1V(iS i )/|iS'|, is too sensitive to edges of 
tiny weights (e.g. adding a complete graph with tiny edge weight will change (j> v (G) to one). One could replace the 
constant 1/2 in the definition of N 1 / 2 (S) by other constant say 0.99 so that the definition of robust vertex expansion 
is closer to the definition of (ordinary) vertex expansion while we can still obtain similar results. 

3 For example, in a planted fc-partition instance where there are k subsets of size n/k with probability p having 
an edge between two vertices in the same subset and probability q having an edge between two vertices in different 
subsets for q <g p, the improved Cheeger’s inequality only proves a 0(fc)-approximation while Theorem 1 proves a 
0(l)-approximation. 

4 For hypercubes, it is known that the edge expansion is fi(l/logn), the vertex expansion is f2(l/v/l°g n) fHar66], 
and the product of the edge expansion and the vertex expansion is 12(1/log n) [Mar74]. We believe that the same 
bounds hold for robust vertex expansion, </) V (G) = Hfl/ydog ~n) and *F(G) = 12(l/logn) but we don’t know of a 
proof yet. If that’s true, Corollary 1 will give a bound of 12(1/log 322 (n)) on the second eigenvalue, and Theorem 1 
will give the correct bound of 12(1/log(n)), while Cheeger’s inequality only gives a bound of 12(1/log 2 n). 
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Theorem 2. For all k >2, 

a 2 = • MG) • 4>{G)). 

Corollary 2. For all 5 < 1/2, 

A 2 = n{6-MG)-<KG)). 

Both Theorem 1 and Theorem 2 are tight up to a constant factor. Both proofs of Theorem 1 
and Theorem 2 show that the spectral partitioning algorithm achieves the performance guarantees, 
i.e. the algorithm would output a set S with 'h(S) = 0( A 2 ) and fi(S) = 0(fcA 2 /^(G)) respec¬ 
tively. These imply that the spectral partitioning algorithm is a 0(1/(jfi (G) ^approximation and a 
0(k/4>k(G) ^approximation for edge expansion. 

1.1.3 Local Partitioning Algorithms for Small-Set Expansion 

Our proof techniques allow us to use the same approach to analyze the local partitioning algorithm 
using personal pagerank vectors [ACL06]. Given a parameter a € (0,1] and a vertex s, the personal 
pagerank vector r S:CX € W 1 is the unique solution to the equation r s>a = ay s + (1 — a)Wr s>a , where 
W is the transition matrix of the lazy random walks. 

Theorem 3. For any (unknown target) set S C V, there is a polynomial time randomized algorithm 
to find a set S' with 

1. fi(S') = OMS) log(|S|)/^(G)) and |S'| = 0(\S\ log |5|), 

2. fi(S') = 0(kfi(S) \og(\S\)/MG)) and |5'| = 0(|S| log |5|), 

by computing r s>a for a random vertex s € S with a = 0(4>(S)) and returning a level set of r s>a . For 
unweighted d-regular graphs, there is a local implementation with running time 0(d\S\ log(|5|)/0(5)+ 
\S\ log 2 |S'|). 

Theorem 3 implies that the personal pagerank algorithm is a 0(log(|S , |)/</>' / (G))-approximation 
and a 0(k log(|S| )/</>& (G))-approximation for the small-set expansion problem where the output 
set size is bounded within a logarithmic factor of the target set size. 

We also present a spectral approach to prove that the local graph partitioning algorithm using 
truncated random walks has similar performance guarantees as the spectral partitioning algorithm. 
Let p s> t ■= W t Xs be the probability distribution vector after t steps of lazy random walks starting 
from the vertex s. 

Theorem 4. For any (unknown target) set S C V , there is a polynomial time randomized algorithm 
to find a set S with 

1. fi(S') = 0(kfi(S)/(eMG ))) and |S'| = 0(|S| 1+e ) ; 

2. fi(S') = 0(kcf(S)\og(\S\)/MG)) and |5'| = 0(|5|), 
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by computing p s ^ for a random vertex s € S with t = ©(logdS'D/^S)) for (1) and t = 0(l/(f>(S)) 
for (2) and returning a level set of p s j- For unweighted d-regular graphs, there is a local implemen¬ 
tation with running time 0(de 2 \S\ 1+e log 2 (|Sj)/<^(S') 3 ). 

Theorem 4 implies that the truncated random walks algorithm is a 0(k/cj>k(G))- approximation or 
a 0(felog(|S|)/(/>fc(G))-approximation for the small-set expansion problem, with different tradeoffs 
of the output set size. 

Our results provide improved analyses of local graph partitioning algorithms when the vertex ex¬ 
pansion or the k- way expansion is large, and provide theoretical justification of their good empirical 
performances in applications such as image segmentation and data clustering (see [ZLM13] and the 
references therein). The results show that the performances of local graph partitioning algorithms 
almost match that of Theorem 1 and Theorem 2 (within at most a O (log (| S |)) - fact or in the ap¬ 
proximation guarantee), with the additional advantages that they apply to the small-set expansion 
problem (giving bicriteria approximations for cj)$(G )) and also that their running time could be 
sublinear in the graph size (when d and |Sj are small enough). 

1.2 Comparisons with Related Work 
1.2.1 Generalizations of Cheeger’s inequality 

There are several recent generalizations of Cheeger’s inequality using higher eigenvalues of the 
Laplacian matrix. The first generalization by Arora, Barak and Steurer [ABS10] relates higher 
eigenvalues to small-set expansions: 

0O(fc- 1 /ioo)(G') = 0{\/\k log fc n), 

and they use it to design a subexponential time algorithm for approximating unique games. The 
second generalization by Louis et al. [LRTV12] and Lee et al. [LOT12] relates higher eigenvalues 
to k-way expansion (a stronger requirement than small-set expansion): 

k < MG) < 0(VA 2 fclogA;), (1-1) 

and this justifies the use of higher eigenvalues in k- way graph partitioning. Then there is a gener¬ 
alization by Kwok et al. [KLLOT13] relating higher eigenvalues to the ordinary edge expansion: 

<KG) < 0(fc)4j=, (!-2) 

which shows that the spectral partitioning algorithm performs better in instances with A f. large for 
a small k. 

Instead of using higher eigenvalues to give better bounds on expansion parameters, our results 
use expansion parameters to give better bounds on the second eigenvalue. We remark that the 
techniques developed in [KLLOT13] could be used to prove Theorem 2 (see Section A in the 
Appendix), but our approach is quite different and could be used to prove Theorem 1 and to 
extend Theorem 2 to analyze personal pagerank vectors. We also note that our proof of Theorem 2 
can be used to prove (1.2) using a graph powering trick as described in [KL14] (see Section A in 
the Appendix). 
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1.2.2 Local Graph Partitioning Algorithms 

Local graph partitioning algorithms are useful in finding a small non-expanding set in a large 
graph, as their running times are only weakly dependent on the graph size and could be sub- 
linear time. All known algorithms are based on some random walks related processes. The 
first local graph partitioning algorithm is a truncated random walk algorithm by Spielman and 

Teng [ST13], which returns a set S' with 4>(S') = 0{yj(j>(S) log 3 n) with work-to-volume ratio 
O(polylog(?r)/0 2 (S')). The second algorithm is a personal pagerank algorithm by Andersen, Chung 
and Lang [ACL06], which returns a set S' with cj>(S') = 0(^4>(S) log(|5|)) with work-to-volume ra¬ 
tio 0(polylog(n)/^>(S 1 )). The evolving set process is used by Andersen and Peres [AP09] to further 
improved the work-to-volume ratio to 0(polylog(n)/-^/ (j>(S)) while having the same performance 
guarantee as in [ACL06]. Using a better analysis of the escaping probability of random walks, Oveis 
Gharan and TYevisan [OT12] (see also [KL12]) showed that the ylog(|S|) factor in the performance 
ratio can be removed, thereby almost matching the guarantee of Cheeger’s inequality. They com¬ 
bined this with the evolving set process to find a set S' with 4>(S') = <j)(S)/e), |S"| = 0(|S’| 1+e ) 

and work-to-volume ratio 0(|5| e polylog(n)/v^). 

Our contribution is to show that the performance of some simple local graph partitioning algo¬ 
rithms (truncated random walks, personal pagerank) almost match that of the improved Cheeger’s 
inequalities. These provide the first analyses showing that random walk based algorithms perform 
better when (f) V (G), 4>k{G ) or A k(G) is large, with similar performances to the spectral partitioning 
algorithm while having additional features. We note that Zhu et al. [ZLM13] gave a better analysis 
of the personal pagerank algorithm when the internal expansion of the target set is large; our results 
are related but incomparable. 

1.2.3 Analysis of Mixing Time 

The notion of expansion profile was introduced by Lovasz and Kannan [LK99] in the study of mixing 
times of random walks. They proved that the mixing time is upper bounded by f x ^ x yi where 
d>(x) = mino< 5 < a; 4>s, which is a better bound on the mixing time when the average conductance is 
large (e.g. small sets expand well in geometric graphs). 

Our work is inspired by their paper and some subsequent work [KLM06, MP05], both in the proof 
techniques (will be discussed in the next subsection) and in the definitions. The robust vertex 
expansion and its expansion profile are studied in [KLM06], where better bounds on the mixing 
time are proved in a similar form to the average conductance bound above. In particular, it implies 
the mixing time is bounded by 0(log(n)/T(G)), and thus A 2 > f2(\H(G)/log(n)). We note that 
Morris and Peres also proved a lower bound on the second eigenvalue (Theorem 15 in [MP05]) using 
a parameter related to vertex expansion, but their definition is incomparable to ours. 

Our contribution is to directly bound the second eigenvalue (not the mixing time) using the ex¬ 
pansion parameters and our bounds are independent of n. Also, the bound that we prove us¬ 
ing 4>k is considerably stronger. Using (j>s, the average conductance bound only gives I/A 2 < 
0(log(5n)/<^ 2 + log(l/5)/<^> 2 ), not improving on Cheeger’s inequality even when efts = 12(1) for 
constant 5, while Corollary 2 gives a 0(l)-approximation when (f>$ = 12(1) for constant 5. 
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1.3 Technical Overview 


The proofs are inspired by the work of Lovasz and Kannan [LK99] . We observe that their method 
is useful in analyzing the solution to a Laplacian system (Lx = b ), and can be extended to study 
both the second eigenvectors (Lx = Xx) and the personal pagerank vectors. 

The high-level approach is to look at the solution vector x € R n with x\ > X2 > ... > x n , and 
relates the (slow) decrease of Xi to the (large) expansion of the level sets in this vector. Similar 
to [LK99], we define a jumping sequence of indices 1 = mo, mi, m 2 , ■ ■ ■ such that x mi — x mi+1 is 
inversely proportional to the expansion of the level set [l,m*] (see Lemma 1). Using the Laplacian 
equation of the second eigenvector, we use an inductive argument to show that if the expansion 
of all level sets is fl(\/A^), then the values of Xj decrease slowly enough such that x n / 2 > 0 (see 
Lemma 2), contradicting that x is orthogonal to the all-one vector. We remark that this gives 
a new and quite different proof of Cheeger’s inequality (e.g. without using the Cauchy-Schwarz 
inequality). To prove Theorem 1, we use the robust vertex expansion to argue that each jump can 
be made longer (m,+1 — m; made larger) and this gives the improved bound. To prove Theorem 2, 
we argue that given an ordering of the vertices, if <pk is large, then there are only a small number 
of indices in the jumping sequence whose corresponding level sets [1, m,] are of small expansion 
(see Lemma 3), and then we modify the induction hypothesis to obtain the result (see Lemma 4). 
The inductive arguments and the use of </>& in arguing about expansions of level sets are the new 
elements in the proofs that improve upon the average conductance bound of Lovasz and Kannan. 
The previous analyses of both the truncated random walk algorithm [ST 13] and the personal 
pagerank algorithm [ACL06] are based on the combinatorial technique introduced by Lovasz and 
Simonovits [LS90] in analyzing the mixing time of random walks. This technique is quite different 
from the analysis of spectral partitioning algorithms. It requires to consider the random walk 
vectors for many different time steps, and it is difficult to incorporate the notions of cj>k or Afc in 
the analyses as the ordering and level sets are changing in each time step 5 . 

Our techniques provide two approaches to lift the analysis of the spectral graph partitioning al¬ 
gorithm for local graph partitioning algorithms, bringing closer the analyses of these two types of 
algorithms. For the personal pagerank algorithm, we use the Lovasz-Kannan approach to directly 
analyze the vector so that we can use </>& to reason about the level sets (Lemma 3). We note 
that this approach is considered by Andersen and Chung to give a simplified proof of the personal 
pagerank algorithm [AC07], and we will reuse some of their lemmas to obtain Theorem 3. For the 
truncated random walk algorithm, we use the spectral approach of Arora-Barak-Steurer [ABS10] 
to directly obtain a vector with small Rayleigh quotient and small support, so that the improved 
Cheeger’s inequalities can be applied to obtain results for approximating small-set expansions 6 . 
Finally, we remark that this approach can be applied to analyze the solutions to other Laplacian 
systems. Consider the following algorithm for approximating edge expansion. For an unknown 
target set S, pick a random vertex s € S, inject n units of current to s and extracts one unit of 
current from every vertex in the graph, sort the vertices by the voltages', and output the level set 
with the smallest expansion among all level sets of size up to n/2. Our approach implies that this 

5 We still don’t know how to do a better analysis for the evolving set process because of this difficulty. 

6 We thank David Steurer for suggesting this spectral approach. 

'Or equivalently, sort the vertices based on the expected hitting time to s. 
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algorithm always outputs a set S' with f(S') = 0{^<j>(S) log n). We believe that this approach 
draws more connections to the mixing time literature and will find further applications. 


2 Spectral Partitioning 

Let A := A 2 and x be a second eigenvector such that Lx = Ax. Sort the vertices so that x\ > X 2 > 
■ ■ ■ L x n . 


2.1 Vertex Expansion 

The proof of Theorem 1 consists of two steps. The first step is to prove the drop lemma and then 
define a jumping sequence to apply the lemma. The second step is to use an inductive argument 
to derive a contradiction if the expansion of all level sets are large. 


2.1.1 Drop Lemma and Jumping Sequence 

The following lemma bounds the decrease of the values in x to the expansion of the level sets of x. 
Recall that w(S, T ) denotes the total weight of the edges with one vertex in S and another vertex 
in T. 

Lemma 1 (Drop Lemma). For 1 < a < b < n, we have 

A E“=i 


x a ~x b < 


u;([l,a], [6, n])' 


Proof. For each i, 


Xi — ^2 WijXj = A X{. 


Sum this equation for 1 < i < a, we have 


IZ XiWi i ~ S w = a Xi - 

i =1 j i= 1 j 


i— 1 


Since Wij = Wji, this can be simplified to 


J2 XiW P ~ x i Wi i = A Xi - 

i<a j>a i<a j>a 


i —1 


Consider the edges from the set [1, a] to the set [b, n}. Each edge contributes (xj — Xj)wij to the 
left hand side, which is at least (x a — x b )wij. Therefore, we have 


u;([l,a], [b,n]) ■ (x a - x b ) < A E Xi and thus x a — x b < 


i= 1 


A E°=l x i 

w([l,a], [b,n]) 


□ 
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We define a jumping sequence of indices to apply the drop lemma. Let mo = 1 and 

m i+ 1 = \rm(l +(f) v 


wher e 4> v (rrii), (j)(mi), (rrii) are shorthands for </> y ([l, rrij]), <^([1, m;]), ^([l, mf\) respectively. Then, 
for rrij < n/2, by the definition of 4> v {m ^), we have 

«;([!,m*], [m i+ i,n]) > -m* • 0(mj). 


Putting it in the above inequality with a = m, and 6 = m *+ 1 , it follows that 


Xm.A X' 


m i+ 1 < 


2Ax. 


mi 




1 


where X/ := y Xj. 


( 2 . 1 ) 


2 — 1 


Note that xi is non-increasing over l. 


2.1.2 Induction 


We will prove the following lemma by induction. 

Lemma 2. If ’P(mj) > 32A and (/>{rrii) > 32A /or all mi < n/2, f/ien x mi+1 < 2x m;+1 for all 
mi < n/2. 


First we see how it implies Theorem 1. Let rrij be the first term in the jumping sequence such that 
rrij > n/2. Note that the assumptions of Lemma 2 would imply that Xj > ^Xj > 0, where the 
last inequality follows because Ei=i x i = ® ( as the second eigenvector is orthogonal to the all-one 
vector) and so all partial sums are positive. But this implies that x* > 0 for all 1 < i < n/2, 
and applying the same argument to — x will give us a contradiction. Therefore, the assumptions of 
Lemma 2 must not hold, and thus there is an rn t < n/2 with f(mj) < 32A or <f>{mi) < 32A, proving 
Theorem 1. 

Now we proceed to prove Lemma 2. It is clear that the inequality holds for mo- Assume that 
x mi < cxrm where c = 2, 8 we would like to prove that x mi+1 < cx mi+1 - Note that 

m i+l mi mi + 1 

Xj = Xj + y; X* < m.jx mi + (m i+ i - mi)x mi < x mi (m i+ 1 + (c- l)mj). 

2— 1 2—1 2=777^ + 1 


Dividing both sides of this inequality by m l+ \, we have 


x. 


mi+l 


< x mi (l + (c-1) 


mi . (c — 1) 

m-i+i 1 + (f v {mi) 


c + 4> v (mi) <j>(mi) 

< Xm i+1 { 


T + </> y (mj)' f>(mi)—2\c 


) < CX mj+1 , 


where the second inequality follows from the definition of myi, the third inequality is by (2.1), and 
the last inequality follows from the following claim by plugging in <p v {mf) for h and <f>{mi) for ip. 
Note that the conditions of Claim 1 follows from the assumptions of Lemma 2, and this completes 
the proof. 


The variable c is used so that we can reuse the calculation here for the proof of Theorem 2. 









Claim 1 . If 2 < c < 4, 32A < hip and 32A < ip, then we have 


t c + h \( V w 


Proof. The conclusion to check is 


c >(^)t^) = (-^)(i+ 2Ac 


which is equivalent to 


1 + h (p — 2\c 1 + h p — 2\c ' 


(c—1 )h 2A c 2 (c — l)/r(2Ac) 


1 + h ip — 2Ac (1 + h)(ip — 2Ac) 
Since <£> > 32A > 2cA, this is equivalent to 


0 > —(c — 1 )h(ip — 2Ac) + 2Ac 2 (1 + h) — (c — l)h(2Xc), 
which can be simplihed to 

£ _l> 2A(l + ft ) = 2A ( 1 + 1) 

c z hip hip ip 

Since 2 < c < 4, the left hand side is at least 1/8. We consider two cases. The first case is when 
1 /{hip) > 1 /p, and so the right hand side is at most 4A /{hip). We have 1/8 > 4A /{hip), as long as 
hip > 32A, which is satisfied by our assumption. The second case is when 1 /(hip) < 1 /ip, and so the 
right hand side is at most 4A /ip. We have 1/8 > 4A/<^, as long as p > 32A, which is also satisfied 
by our assumption. □ 


2.2 Proof of Theorem 2 

We follow the same approach to prove Theorem 2. The additional arguments are in Lemma 3 to 
bound the number of terms in the jumping sequence with small expansion using ff. and in Claim 2 
to control the inductive bound dynamically. 

For Theorem 2, we define the jumping sequence as follows. Let mo = 1 and 

1 

m i+ 1 = |m.j(l + -0(mi))]. 


Then, for m* < n/2, we have 


w([l,mi], [m i+1 ,n]) > m^m*) - (m i+ i - m, - 1) > -m* • 
so that equation (2.1) still holds after applying the drop lemma. 
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2.2.1 k- way Expansion 


The assumption on <j>k allows us to bound the number of terms in the jumping sequence with small 
expansion. We note that the following lemma can be applied to any ordering of vertices (not just 
for second eigenvector), and it will be applied to personal pagerank vectors later. 

Lemma 3. For any 9 < fa/ 4, there are at most 16 k/fa terms mi in the jumping sequence with 
9 < (t>(mi) < 29. 


Proof. Suppose by contradiction that there are at least 16 k/fa such terms. Let yo be the first such 
term and let y* be the (l§i/fa)-th. such term. We claim that the sets {[1, yo], [yo, yi],..., [yk-uVk]} 
are all of expansion less than fa, contradicting the definition of 4>k ■ Note that 

9 is 8 9 8 9yi 

Vi+i > 2M 1 + 77) * k > Vi{ 1 + —), and thus y i+ i - y* > 

2 fa 9k 

w{[yi,yj+ 1 ], [l, yi] U [yi+i,n]) 

Vi+i ~ yi 

w([l,yj], [l, Vi]) + w([l,y i+1 ], [l,yi+i]) 

Vi+i ~ Vi 

29y i+ i + 2% 

Vi +1 - yi 

29(1 + —^—). 
yi+i - yi 

Using the lower bound on yj + 1 — yi, we have 

<j>([yuyi+ 1]) <2 9 + ^- < 4 > k , 

where the last inequality is by our assumption that 9 < (fk /4. □ 

2.2.2 Induction 

In the following, we assume that </>| > 1024A, as otherwise Theorem 2 
the following lemma by induction. 

Lemma 4. If </>| > 1024A and 4>(rrii) > 256k\/(fk for all mi < n/2 
mi < n/2. 

As argued before, the assumptions of Lemma 4 would imply that x* > 0 for all 1 < i < n/2, leading 
to a contradiction. So, the assumptions of Lemma 4 must not hold, and thus there is an nij with 
(j>(mi) < 256/cA/^fc, proving Theorem 2. 

To prove Lemma 4, we will prove by induction that x mi < CiX m . where initially co = 2 and 

Ci if (j>(mi) > (f k / 4, 

Ci/(1 - ejC») if < 4>k/ 4, where e* = 2A /<t>(mi). 



holds trivially. We will prove 
, i/ien x mi+1 < 4x mi+1 /or all 


The expansion of the set [y*, y*+i] is 
4>{[yi,yi+ 1 ]) = 
< 

< 
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We first assume this induction step and show that c 00 < 4 using Lemma 3. Then we will verify the 
induction step. 

Claim 2. c 00 < 4. 


Proof. First, we prove by induction that 


Ci = 


co 


1 - £i=o e i c o 


Assume this is true for i. Then 

Ci 


Ci~l~ 1 — 


= ( 


Co 


-)( 


Co 


1 e*Cj l J2j=o e j°o -*■ (i_5TTo e j c o ) £j=o e i c o 


Next, we bound Cqo using Lemma 3. Recall that e* = 2A /farrii) and we can assume that farrii) > 
256k\/((>k- Let 9q = 256k\/4>k and 0j+i = 20*. By Lemma 3, there are at most 16 k/fa terms m* 
in the jumping sequence with 6 < ct>(rrii ) < 20 when 0 < fa/ 4. Therefore, 


E 


e 3 


E E * 

2>0 j:6i<(p(mj)<20i 


^e £ f ^ 

2>0 j:0i<(j){rnj)<.20i 


E 

i>0 


16/c 2A 

0 k 0/ 


E 


32A;A fk 
fa 256fcA2* 


1 

4' 


Therefore, 


Coo — 


co 


1 - £j CjCo 


< 


Co 


1 _ 

1 4 


= 4. 


□ 


We prove the induction step. There are two cases, depending on whether farrii) < fa/A. We first 
consider the case when farrii) < fa/4:. In this case, just apply equation (2.1) and we have 

2\CiX mi . ^^_/l — CiCis x m i+ 1 

x m,i +1 E X mi — x mi\ 1 CjC*) E x nii ( ) E j 

farrii) Ci Cj+i 


by the definition of e* and Cj+i and we are done in this case. 

It remains to consider the case when farrii) E fa/4. By induction, we assume that x mi < c*x mi , 
and we claim that x m . i+1 < CiX mi+1 . By the same calculation as in the induction for Theorem 1, we 
have 

mi + i mi rrii + 1 

Xi = 22 Xi + 22 Xi ~ X m. l ( m i+ 1 + (Cj - l)m*). 

2— 1 2=1 z=mi + l 

Similarly, dividing both sides of this inequality by m*+1, we have 


x m i+1 < x m i+1 , 1 + ± {mi) 


l( 


c* + \ farrii) farrii ) 


)( 


fam/) ~ 2Aq 


) < CjX 


"U+l j 


where the last inequality follows from Claim 1 by plugging in /i = farm)/ 2, ip = farrii), c = c* and 
checking that the conditions 2 < c < 4 (Claim 2), hip > ip 2 /2 > fa/32 > 32A and </? E 32A are 
satisfied by our assumptions. This completes the induction step and thus the proof of Lemma 4. 
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3 Personal Pagerank 


We show that a similar and simpler analysis applies to the personal pagerank vector. Given a 
parameter a € (0,1] and a vertex s, the personal pagerank vector r S)Q € M n is the unique solution 
to the equation r sa = a\ s + (1 — a)Wr Sj0l , where W is the transition matrix of the lazy random 
walks. Note that r SjQ is a probability distribution vector. In the following, we assume S is an 
unknown target set with 3|5| log(|Sj) < n. 


3.1 Drop Lemma 


Let x := r St0i be the personal pagerank vector and assume x\ > X 2 > ... 
Chung proved a drop lemma for pagerank vectors (see Lemma 1 of [AC07] 
Lemma 1), for 1 < a < b < n, 

a 

X a Xb E 7TZ i 77 TT* 

ra([l,aj, [ b , n\) 


> x n . Andersen and 
and compared to our 

(3.1) 


3.2 Escaping Probability 


Let S be an unknown target set. Using a bound on the escaping probability of random walks [ST13] 9 , 
Andersen and Chung proved that for half of the vertices s in S, the personal pagerank vector 
x := r Sta will have the property that (see Lemma 5 of [AC07]) 


yi Xj > i - 


a 


(3.2) 


Setting a = 3 (j>(S) makes sure that Ylies x i — 2/3 and it follows that (see Lemma 2 of [AC07]) 
there exists an a < |i>| with 

2 

3alog(|S|)' 


3.3 Vertex Expansion 

For vertex expansion, we start our jumping sequence by setting mo = a and then define 

m i+ 1 = \rrii(l + <j> v (G))]. 

By this definition, we have w([l,m,i], [mi+i,n]) > \mi ■ and it follows that 

2a , , 2a 


x m i+ 1 A Xrrii ,, 

mi ■ 4>{mi 


and x moo > x a - 


i>0 


mi ■ 4>(mi)' 


Suppose by contradiction that 4>(mi) > 36cj)(S) log(|5|)/^(G) for all rn t < 3|5| log(|5|). Then 

2a . nt- 2 acj) V (G) . 1 


E 


< 


E 


< 


i>0 m i ■ H m i) ^ 36a(l + 0 y (G)) i </>(5)log(|S'|) 3alog(|5|)’ 


9 Actually, using a stronger result by Oveis Gharan and Trevisan [OT12], one can show that x(S) > , 

but it does not change the results in the following subsections. 
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where the last inequality uses the bound that ^- >0 1/(1+^)^ (G)) 1 < (l+(/> v (G))/(j) V (G) < 2/(j) V (G) 
and our choice that a = 3 <fi(S). This implies that 


1 

X3|S|log(|S|) - 3alog(|S|) 


and thus 


J2J2 


i 


j> o 


0<j<3|S| log(|S|) 


3a log(IS'l) 


> 1, 


since a < |Sj, contradicting that x is a probability distribution vector. Therefore, there must exist 
an mi < 3|Sj log(|5|) with 4>{rrii) < 36cj)(S)log(\S\)/(j) V (G), proving the first part of Theorem 3. 


3.4 A-way Expansion 

For k -way expansion, we define the jumping sequence by setting mo = a and 

m i+ 1 = |"m;(l + 0(mj))]. 

As before, we have «;([!, m*], [rrii + i,n]) > \rrii ■ cj)(mi), and it follows that 


2a 


%mi-L -1 ^ %rrii ( \ 

+ mi ■ 


and x moo > x a - 


2 a 


We divide the summation into two parts 


E 


2a 


+ E 


i>0 


2a 


rrn ■ (j>(mi)' 


rrij • dirrij ) ^ m; • (birrij) 

v.(j>{mi)<(j> k /A i-<t>{mi)>(j > k /4 


The second part is at most 


E 2 a ^ v -v 
m ■ t IA E_ 


8a 


< 


64a 


4 ^ a^ fe (l + 0fe/4)* a(j) 2 k ' 

The first part can be bounded by Lemma 3 as follows. Suppose by contradiction that ^(m*) > 
1152k(j)(S) log(|Sj)/0fc for all rrn < 3|<Sj logd<Sj)- Let 9q = 1152k(j)(S) logd<Sj)/</>fc and 9i = 26A_i for 
i > 1. By Lemma 3, there are at most 16 k/cf>k terms in the jumping sequence having conductance 
between 9 and 29 when 9 < 4> k /4. Therefore, the first part is at most 

v ~\ v 2a ^ ^—> 32 k a ^—», 32 k a 64Aa 

2-^ 2—d a(j){m A _ rh,. nfij E— 1 


j i: 6 j<(j)(mi )<26 

Putting these back into the first inequality, we have 

64a 64Aa 
> X n ---— > 


-j' 4>k a9j z --' 4>k a2Wo cuj>k9 o 


192 <j>(S) 192 0(S)fc 


oo _ 


cuti a4> k 9 0 SalogdS'l) acjrl 


acf>k9o 


> 


384 (/>(S)k 


> 


3alog(|Sj) ac/) k 9o Salogd^l)’ 
where the second inequality is by the lower bound of x a and the choice of a = 3 4>(S), and the last 
inequality is by our choice of 9q. This implies that 

> 3a i Q g ( |g|) E *12 E 3 alog(|S|) > *■ 

6VI u j> 0 0<i<3|S|log(|S|) &VI U 

since a < |Sj, contradicting that x is a probability distribution vector. Therefore, there must exist 
an rrii < 3|log(|5|) with rj)(mi ) < 1152 kc/)(S) log d £j )/</>&, proving the second part of Theorem 3. 
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3.5 Local Algorithm 


Andersen and Chung [AC07] show that the drop lemma (equation (3.1)) still holds even for ap¬ 
proximate personal pagerank vectors, which can be computed efficiently in unweighted graphs. In 
the following, we assume the graphs are unweighted d-regular (in our setting, the edge weights 
are either 1/d or 0). An e-approximate vector for r s>a is a vector r' sa that satisfies r' s a = 
a (Xs — q) + (1 — a)Wr' sa where the vector q is non-negative and satisfies q(u) < e for every 
vertex u in the graph. 


Lemma 5 ([AC07]). There is an algorithm that computes an e-approximate vector r' SOL . 
running time of the algorithm is 0(d/(ea)). Assume r' s a ( 1) > r' s a ( 2) > ■■■ > r' sa (n). 
approximate vector r' s a satisfies for any 1 < a < b < n, 


r 's, a ( a ) - r 's,a ( b ) < 


a 


w([!,a], [b, re]) 


The 

The 


Note that 0 < q < el implies 


r s,a ~ r' s>a = a(I - (1 - a)W) (% s - (% s - q)) = a(I - (1 - a)W) 1 q < ea(I - (1 - a)W) 1 = el, 

where the last equality holds since 1 is an eigenvector of both I and W with eigenvalue 1. Hence 
for any vertex u, we have r' sa {u ) > r S:0l (u ) — e. We set e = 1/(6|5|) and a = 3(j>(S) so that 

r 'sA^ ^ - e \ s \ ^ 1 _ - 


a 


6 ~ 2 ’ 


ies ieS 

for those vertices s that satisfy equation (3.2). Hence there exists an a < 151 with 

r'sM ^ 


2a log |5| 


Since r' s a satisfies the drop lemma (equation (3.1)) and contains good initial value, both arguments 
in vertex expansion and k-way expansion follow (with the assumption 3|<S| log(IS'l) < n replaced 
by 6|5| log(IS'l) < n). The runtime of this algorithm is dominated by the runtime for computing 
the approximate vector p' st and sorting at most 0(|5| log(|5|)) vertices after, and hence the total 
complexity is 0(d/(ea) + |S| log 2 (|S'|)) = 0(d\S\/<f>(S) + |5| log 2 (|5|)). 


4 Random Walks 

In this section, we present a spectral analysis of the random walk local graph partitioning algo¬ 
rithm [ST13, KL12]. The proof consists of three steps. The first step is to show that the Rayleigh 
quotient of the random walk vector p Sj t = W t Xs is small, by using the analysis in the power method. 
The second step is to show that the ||p S) t ||2 is large for many vertices s in the unknown target set, 
by using the bound on escaping probability (or the staying probability). This allows us to apply 
the argument in [ABS10] to p St t to obtain a vector with small Rayleigh quotient and small support. 
Then we apply the improved Cheeger’s inequality to prove Theorem 4. Finally, we show that the 
truncated random walk vectors would also work, thereby proving a local implementation of the 
algorithm. 
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4.1 Rayleigh Quotient 


Recall that the Rayleigh quotient of a vector x is defined as IZ(x) = x T Lx/\\x\\ 2 . The following 
lemma shows that the Rayleigh quotient of the vector p s j := W t Xs becomes smaller when t becomes 
larger. The proof follows the analysis of the power method in computing the largest eigenvector. 

Lemma 6. For any starting vertex s, 


n(p s> t)<2-2\\p Stt \\ 1 2 /t 


Proof. Let Xs = ^- l =1 CjUj where V{ are eigenvectors of L. Note that the lazy random walk matrix 
is W = I — L/2, and thus the vector p st = W t Xs = Y17= i c *(l — Aj/2) t Uj. Hence, the Rayleigh 
quotient of p Si t is 


T^{p s ,t) 


EE^C-a^a, - Xdd-W2) 2W 
lb*,ill 2 Efc, o?(i - A,/2) 2 ‘ - E?,U 2 (1 - a./2)=‘ ' 


Note that JT c 2 = [|Xs 111 = 1) and thus c? can be viewed as a probability distribution. Let X 
be the random variable having value 1 — Aj/2 with probability cf. Then we can write lZ(p s> t) = 
2 — 2E[Al 2 * +1 ]/E[X 2 *]. By the power mean inequality and the non-negativity of X , we have 


E[X 2t+1 ]V( 2t+ B > EfX 2 *] 1 /^. 


Hence 


ft(p Sit ) < 2 - 2E[X 2t ] 1 / 2t = 2-2 ( J^c 2 (! - \i/2) 2t 


1 / 2 1 


— 2 2||p S) t 


ii/t 
12 ' 


, j=l 


□ 


4.2 Small Support Vector with Small Rayleigh Quotient 

A vector x is called spectrally 5-sparse if ||x|| 2 < <5n||x||2. First, by using a result by Oveis Gharan 
and Trevisan on escaping probability (or staying probability), we bound the spectral sparsity of the 
random walk vector. Then, we use a result used by Arora, Barak and Steurer to turn a spectrally 
sparse vector into a small support vector with similar Rayleigh quotient. 

The following lemma by Oveis Gharan and Trevisan shows that if 4>(S) is small, there is a large 
subset U C S, such that the random walk starting at any vertex s E U stays entirely inside S with 
good probability. In particular, the probability that the walk ends inside S is large. 


Theorem 5 ([OT12]). For any subset S C V, there is a subset U C S, such that \U\ > \S\/2, and 
for any s € U we have 


5 2p»A v ) 

veS 


1 

“ 200 



t 


This provides a bound on the spectral sparsity of p s j- 
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Lemma 7. For any subset S C V, there is a subset U C S such that \U\ > |S|/2, and for any 
s € U we have 


„ l|2 . 40000|5| ||2 

H^ll 1 “ (1-3 <j>(S)/2) 2t ^ Ps,t ^ 2 


Proof. By Cauchy-Schwarz and Theorem 5, we have 

IlftJl > E2 ]if fe2 IS? (m f 1 - ^T 1 ) 


v£ S 


\v£ S 


|5| l 200 


2 J 


40000|5| 


Since ||XsIIl = 1 and bF preserves 1-norm, we have ||p S) t|| 2 = ||hh*x s || 2 = 1, and the 


ms)y 

result follows. 

□ 



The following lemma in [ABS10] shows how to obtain a vector y with small support and similar 
Rayleigh quotient from a spectrally <5-sparse vector x. The proof is by choosing an appropriate 
threshold t and set y = max(i — t, 0). 

Lemma 8 ([ABS10]). Let x € M^J be a non-negative vector with ||x|| 2 < <577,||a;|||. Then there exists 
a vector y with supp(y) = 0(Sn) and 7 Z(y) = 0(TZ(x)). 

We will apply Lemma 8 on p s j and obtain a vector with small Rayleigh quotient (Lemma 6) and 
small support (Lemma 7). 


4.3 Improved Cheeger’s Guarantees 


We are ready to prove Theorem 4. In the following we assume f(S) <1/4 and IS 1 ! > 2. We set 
t = elog |S|/(6</>(S)) so that 

> exp(-3</(S)) 24 = exp(—6 t<f>(S)) = exp(-elog \S\) = \S\~ e . 


1 - 


By Lemma 7, we have 


On the other hand, since \S\ > 2, 


||p s ,t||^ < 40000|S'| 1+e |b s , t ||| 


(4000015*1) 


— 1 /( 2 1 ) 


> exp — 


17 log | S\ 
2 1 


= exp 


51^(5) ^ > : 510(5) 


Therefore, by Lemma 6, we have 


n M < 2-2(1 -Ma )( i_51M ) = 0 


Mj. 


Now, we apply Lemma 8 by plugging the vector p S) t for x and obtain a vector y with supp(y) < 
0(15*| 1+e ) and TZ(y) = 0((j>(S)/e). Finally, by the proof of the improved Cheeger’s inequality (1.2) 
(see Section A in Appendix), we find a level set S' with \S'\ < |supp(y)| = 0(|S'| 1+e ) and 


= or «S') = o(M2). 


V e 4>k J 


V J 


Since a level set of y is a level set of p s t, this proves the approximation guarantee of Theorem 4. 
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4.4 Local Algorithm 


Computing the vector p Sj t = W t Xs exactly requires at least linear time. In the following, we assume 
the graph is an unweighted 4-regular graph (in our setting, the edge weight is either 1/4 or 0). To 
obtain a local algorithm, we can compute a good approximation to p s j by repeatedly applying the 
operator W (initially we compute W\s) an d truncating the small values to zero. 

Lemma 9 ([ST13, KL12]). Let p S}t = I V t x s be the exact random walk vector starting at vertex s. 
There is an algorithm that compute a vector p' s t such that p s j > t > p s ,t — a 1 and p' st > 0 in 
time 0(dt 2 /a). 


We set t = elog |S'1/0(5) and a = 0(S)/(16OOOO|S| 1+e ), so that the time complexity of our local 
algorithm is 0(de 2 \S\ 1+e log 2 |S|/0(S) 3 ). It remains to show that p' st is still spectrally sparse and 
has small Rayleigh quotient. 


Lemma 10. For p S: t that satisfies the conclusion in Lemma 7, we have 


\\Ps 


’s.tlll ^ 


1 

80000|S| 1+e 



2 

2 - 


Proof. In the proof, we let x := p Sj t and y := p' st . By Lemma 9, we have y{i) 2 > x{i) 2 — 2 ax(i) 
since y(i) > max(x(i) — a, 0). Therefore, 


Iblll = >^x{i) 2 - 2 a^x(i) = \\x\\l - 2a. 

i i i 

By Lemma 7, we have ||x ||2 > l/(40000|S| 1+e ). From our choice of a, we have 2a = 0(S)/(8OOOO|S| 1+e ) < 
0 (S)||.t|| 2/2. Therefore, 

II^H 2 ~ 2a ~ H x ll 2 i 1 2 ) “ 80000|S| 1+e “ 80000|S| 1+e ^^’ 
where the last inequality holds as||y|| 3 <||x||f = l. □ 

Lemma 11. 

W,f) < o(M). 


Proof. Again, we let x := p s ,t and y := p' s t in the proof. Let r = x — y > 0. Then we have 
y T Ly (x — r) T L(x — r) x T Lx + r T Lr — 2x T Lr 2x T Lx + 2r T Lr 

"Ml/) j' 7“ 7^ — j' • 

y y y 1 y y 1 y y 1 y 

By the calculation in Lemma 10, \\y\W > (1 — 0(S)/2)||x|||. Hence, using r > 0 and y > 0, we have 
Drill < II^Hl — \\y\\l — 0(5*)H^lll/2 and r T Lr < 2r T r < 0(S)||.t|||. So, we have 


K{y) = O 


f x t Lx 

V y T y + 



o 


( x t Lx 
\ x T x 


r T Lr\ 
H- f — ) 

x 1 x J 


0(K{x) + 0(S)) = O 



□ 
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With Lemma 10 and Lemma 11, we can use the same proof in Section 4.3 to prove Theorem 4 with 
the time complexity claimed. 

To prove Theorem 4(2), we only need to set e = 1 /log(|/S'|) so that |5'| 1+e = 0(151). 

4.5 Local Eigenvalue 

We remark that if we do not care about local implementations, we can find a particular good 
starting vertex u such that the random walk algorithm starting at u gives a better performance 
guarantee ^>(5') = 0(k\s /(e</>fc)), where A s is the smallest eigenvalue of the matrix Ls which is the 
restriction of L on the subset S. Chung [Chu07] shows the following local Cheeger’s inequality: 

A s < mind^T) < y / 2A s- 
TCS 

Hence A s is at most 4>(S) and could be much smaller, for instance when a subset of S has very 
small expansion. The idea is similar to that in [KL12] and we just give a quick sketch. Let 
vs be the corresponding eigenvector with eigenvalue A s- We choose our starting vertex to be 
u = argmaXj |t>s(i)|. Then we show that the spectral sparsity of the f-steps random walk is at most 
|S|/(1 — As) 2t < |5|/(1 — 0(cj)(S))) 2t . This allows us to set t to be larger so as to improve the 
Rayleigh quotient of the random walk vector. 
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A Relations between Improved Cheeger’s Inequality and Theo¬ 
rem 2 

First, we show that Theorem 2 can be derived from the proof of the improved Cheeger’s inequality, 
as pointed out to us by Luca Trevisan. Then, we show that the improved Cheeger’s inequality can 
be derived from Theorem 2, using a graph powering trick as described in [KL14]. 


A.l Improved Cheeger’s inequality implies Theorem 2 

The following stronger statement was shown in [KLLOT13]. 

Theorem 6 (Theorem 3.5 of [KLLOT13], restated). For any non-negative vector x with supp(x) < 
n/2, let 4> S weep(x ) be the minimum expansion of the level sets of x. At least one of the following 
holds: 


1- 4 > sweep(,x') A o(k)n(x). 

2. There exists k disjointly supported vectors x\,Xk such that for all 1 < i < k, supp(xj) C 
supp(.x) and TZ(xi) < 0{k 2 fZ{x) 2 / (f sweep (x) 2 ). 


We apply the theorem with x = max(u 2 , 0) or x = max(— v 2 , 0), whichever of smaller support. Note 
that TZ(x) < A2 by standard argument [HLW06]. When the first case of Theorem 6 holds, it is clear 
that 

4>(G) < </Weep(*) < 0(k)K(x) < 0(k \ 2 ) < O(^). 

Otherwise, there exist k disjointly supported vectors, each with Rayleigh quotient not larger than 
0(fc 2 A|/4weep (x)). Apply (the original) Cheeger’s arguments on these vectors, we can find k 
disjoint subsets Si, each of them satisfies cj>(Si) < 0(k\ 2 / 4> sweep (x)) . This implies that 


A. <r n(. 




and Theorem 2 follows. 
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A.2 Theorem 2 implies improved Cheeger’s inequality 


In [KL14], the authors proved a lower bound on the expansion of graph powers and used it to show 
some reductions on Cheeger’s inequalities. We show that the same approach can be used to prove 
improved Cheeger’s inequality by Theorem 2. 

Theorem 7 (Theorem 1 of [KL14], restated). Let H denote the graph with adjacency matrix W l 
where W is the lazy random walk matrix of G. Then we have 

m) > 4(i - (i - ^4)' /I ). 


The following corollary is a generalization of Corollary 12 of [KL14], which shows that general cases 
of improved Cheeger’s inequality can be reduce to the cases where is constant. 

Corollary 3. Suppose one could prove that 4>(H) < C\ 2 {H) for some C > 1/10 whenever A k{H) > 
1/4, then it implies that 4>(G) < 40CA2(G)/y / A k(G) for any G and any A k(G). 

Proof. We assume that 4>(G) < \fXf, as otherwise, by Cheeger’s inequality, 2A2(G) > 4>(G) 2 > 
(j)(G)\f\f and the statement is true. Consider H with adjacency matrix W l ^ k ^ G \ Then 

\ k (H) = 1 - (1 - )‘A. > 1 - exp(-t) > 1/4. 

Therefore, if one could prove that <f>{H) < then 


CX 2 (H) > > —(1 - (1 - > _L(i _ exp (-^£L)) > ^ 


20' v ~ 2 ' — 20 v 2^X^G)” ~ W^(G)' 

where the second inequality is by Theorem 7. On the other hand, 


and the corollary follows by combining the two inequalities. 


□ 


Now we show the improved Cheeger’s inequality in [KLLOT13] follows from Corollary 3 and The¬ 
orem 2. By the easy side of the higher order Cheeger’s inequality, we have </>& > A^/2. Hence, for 
any graph G with A & > 1/4, we have </>& > 1/8 and Theorem 2 gives <f(G) = 0{kX2{G)). Therefore, 
we can apply Corollary 3 (with C = 0(h)) and conclude that < j>(G) = 0(kX2{G)/yJX & (G )) is true 
for any graph G and any A&, and the improved Cheeger’s inequality in [KLLOT13] follows. 
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